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DETAILED ACTION 

1 . This action is responsive to the Request for Continued Examination filed on 01/14/2009. 

2. The amendment to the Specification has been entered. 

3 . The Declaration filed on 04/28/2009 has been entered. 

4. Claims 7, 20 and 30 have been previously cancelled. 

5. Claims 1-6, 8-19, 21-29 and 31-38 are being allowed. 

EXAMINER'S AMENDMENT 

6. An examiner's amendment to the record appear below. Should the change and/or 
additions be unacceptable to the Applicant, an amendment may be filed as provided by 37 CFR 
1.312. To ensure consideration of such amendment, it MUST be submitted no later than the 
payment of issue fee. 

Applicant filed Request for Continued Examination on 01/14/2009 to amend the 
Specification to remove the claim of priority to U.S. Application Serial No. 10/406,709 filed on 
April 3, 2003. Applicant submitted claims 1-6, 8-19, 21-29 and 31-38. 

Authorization for examiner's amendment was given in a telephone interview with Martin 
J. Sultana, Registration No. 57,739, on 12/04/08 to put the claims in condition for allowance. A 
proposed amendment has been received and adopted by the Examiner - See below. 

IN THE CLAIMS: 

This listing of claims below will replace all prior versions, and listings, of claims in the 
application. 
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1 . (currently amended) A trace processor configured to support parallel execution of 
multiple instructions, the trace processor comprising: 

a trace detector that identifies traces in a segment of code including successive 
instructions, each of multiple identified traces in the segment of code including a set of 
instructions capable of being executed on an execution unit; 

a dependency detector that, prior to parallel execution of multiple identified traces on 
corresponding execution units, analyzes the traces identified in the segment of code to determine 
a dependency order for executing the traces, the dependency order identifying at least one of the 
traces associated with the segment of code that cannot be property executed in parallel with 
another trace in the segment of code; 

a trace scheduler coupled to the dependency detector and the trace detector, the trace 
scheduler receiving a set of traces and, based on the dependency order, causing the 
corresponding; execution units to execute traces within the set of traces in parallel, the execution 
taking place in an execution order that is based on the identified dependency order, at least two 
traces bcinu executed in parallel and if the dependency order indicates that a second trace is 
dependent upon a first trace, the first trace being executed prior to the second trace; 

multiple execution units to execute multiple traces in parallel based on the dependency 

order; 

a buffer to temporarily store results associated with execution of multiple executed traces; 
and 

a comparator circuit that, at run time of executing the multiple traces in parallel, identifies 
an out-of-order memory dependency condition associated with parallel executed traces resulting 
in an error; and 

the comparator circuit, in response to identifying the out-of-order memory dependency 
condition: 

squashes execution of latter traces in the segment of code that depend on results from 
earlier traces; and 

clears results in the temporary buffer associated with the squashed traces. 
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2. (previously presented) The trace processor as in claim 1 , wherein the traces each include 
a sequence of contiguous instructions intended to be executed 

successively in time and the dependency order indicates which of the multiple traces must be 
executed before others identified in the segment of code. 

3. (currently amended) The trace processor as in claim 2 , wherein further comprising: 
[[a]] the scheduler [[that]] schedules parallel execution of traces detected within a basic 

block of JAVA code on multiple execution units according to the dependency order. 

4. (previously presented) The trace processor as in claim 1 , wherein the trace detector 
identifying traces in the segment of code includes identifying operand stack dependencies 
associated with portions of the segment of code and wherein the corresponding execution units 
each include an operand stack. 

5. (previously presented) The trace processor as in claim 1, wherein the dependency 
detector analyzes the traces to determine data dependencies associated with traces in the segment 
of code and identifies the dependency order for executing at least some of the traces in parallel at 
run time. 

6. (previously presented) The trace processor as in claim 1 wherein the comparator circuit, 
in response to identifying the out-of-order memory dependency condition, reschedules squashed 
traces for later execution. 

7. (canceled) 

8. (previously presented) The trace processor as in claim 1, wherein at least one of the 
traces is processed to include a folded bytecode instruction replacing a corresponding sequence 
of bytecode instructions. 



9. 



(previously presented) The trace processor as in claim 1 further comprising: 
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a fetcher that fetches multiple code instructions from different traces identified in the 
segment of code; 

a decoder that decodes the multiple fetched code instructions into corresponding 
bytecode instructions; and 

a buffer unit to store the bytecode instructions associated with the multiple decoded code 
instructions in corresponding trace buffers for each trace. 

10. (previously presented) The trace processor as in claim 1 wherein the comparator circuit 
identifies an out of order memory dependency condition based on a search for: 

i) a READ after a WRITE to the same memory address for different parallel executed 

traces, 

ii) a WRITE after a READ to the same memory address for different parallel executed 
traces, and 

iii) a WRITE after a WRITE to the same memory address for different parallel 
executed traces. 

1 1 . (previously presented) The trace processor as in claim 1 , wherein the dependency 
detector analyzing the traces in the segment of code determines a dependency order by 
comparing memory access instructions in a first trace to memory access instructions in other 
traces identified in the segment of code to identify a potential trace dependency in which the first 
trace contains a memory access instruction that depends on the operation of another memory 
access instruction in at least one of the other traces identified in the segment of code. 

12. (previously presented) The trace processor as in claim 1 , wherein the trace detector 
identifying traces within the segment of code: 

identifies a beginning trace instruction in the segment of code whose operation 
corresponds to a first clean condition of an execution unit; 

identifies a subsequent trace instruction in the segment of code whose operation 
corresponds to a non-clean condition of the execution unit; and 
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identifies an ending trace instruction in the segment of code whose operation 
follows the first clean condition and the non-clean condition of the execution unit and that 
corresponds to at least one of: 

i) a second clean condition of the execution unit; and 

ii) an end of the segment of code; and 

designates, as a trace within the segment of code, all instructions in the segment 
of code including, and in-between, the beginning trace instruction and the ending trace 
instruction. 

13. (previously presented) The trace processor as in claim 1 , wherein the dependency 
detector identifying the dependency order, upon completion of execution of at least two traces, 
updates the dependency order to remove any trace dependencies associated with other non- 
executed traces that depended on completion of execution of the at least two executed traces. 

14. (currently amended) A method associated with parallel execution of multiple 
instructions, the method comprising: 

identifying traces in a segment of code including successive instructions, each of multiple 
identified traces in the segment of code including a set of instructions capable of being 
executed on an execution unit; 

prior to parallel execution of multiple identified traces on corresponding execution units, 
analyzing the traces identified in the segment of code to determine a dependency order for 
executing the traces, the dependency order identifying at least one of the traces associated with 
the segment of code that cannot be properly executed in parallel with another trace in the 
segment of code; 

receiving a set of traces and, based on the dependency order, to execute traces within the 
set of traces in parallel, the execution taking place in an execution order that is based on the 
identified dependency order, at least two traces being executed in parallel and if the dependency 
order indicates that a second trace is dependent upon a first trace, the first trace being executed 
prior to the second trace; 
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executing multiple traces in parallel based on the dependency order; 
temporarily storing results associated with execution of the multiple traces in a temporary buffer; 
and 

at run time, identifying an out-of-order memory dependency condition associated with 

parallel executed traces resulting in an error; and 

in response to identifying the out-of-order memory dependency condition: 

squashing execution of latter traces in the segment of code that depend on results from 

earlier traces and clearing results in the temporary buffer associated with the squashed traces. 

15. (original) A method as in claim 14, wherein the traces each include a sequence of 
contiguous instructions intended to be executed successfully in time and the dependency order 
indicates which of the multiple traces must be executed before others identified in the segment of 
code. 

16. (original) A method as in claim 15 further comprising: scheduling parallel execution of 
traces detected within a basic block of JAVA code on multiple execution units according to the 
dependency order. 

17. (original) A method as in claim 14, wherein identifying traces in the segment of code 
includes identifying operand stack dependencies associated with portions of the segment of code 
and wherein the corresponding execution units each including an operand stack. 

18. (original) A method as in claim 14, wherein analyzing the traces includes determining 
data dependencies associated with traces in the segment of code to identify the dependency order 
for executing at least some of the traces in parallel at run time. 

19. (previously presented) A method as in claim 14 further comprising: rescheduling 
squashed traces for later execution. 



20. (canceled) 
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21 . (original) A method as in claim 14, wherein at least one of the traces is processed to 
include a folded JAVA bytecode instruction replacing a corresponding sequence of JAVA 
bytecode instructions. 

22. (original) A method as in claim 14 further comprising: 

fetching multiple code instructions from different traces identified in the segment 
of code; 

decoding the multiple fetched code instructions into corresponding bytecode instructions; 

and 

storing the bytecode instructions associated with the multiple decoded code instructions 
in corresponding trace buffers for each trace. 

23. (previously presented) A method as in claim 14 wherein identifying an out of order 
memory dependency condition includes searching for: 

i) a READ after a WRITE to the same memory address for different parallel 
executed traces, 

ii) a WRITE after a READ to the same memory address for different parallel executed 
traces, and 

iii) a WRITE after a WRITE to the same memory address for different parallel executed 

traces. 

24. (original) A method as in claim 14, wherein analyzing the traces in the segment of code 
to determine a dependency order includes: 

comparing memory access instructions in a first trace to memory access instructions in 
other traces identified in the segment of code to identify a potential trace dependency in which 
the first trace contains a memory access instruction that depends on the operation of another 
memory access instruction in at least one of the other traces identified in the segment of code. 
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25. (original) A method as in claim 14, wherein identifying traces within the segment of 
code comprises: 

identifying a beginning trace instruction in the segment of code whose operation 
corresponds to a first clean condition of an execution unit; 

identifying a subsequent trace instruction in the segment of code whose operation 
corresponds to a non-clean condition of the execution unit; and 

identifying an ending trace instruction in the segment of code whose operation follows 
the first clean condition and the non-clean condition of the execution unit and that corresponds to 
at least one of: 

i) a second clean condition of the execution unit; and 

ii) an end of the segment of code; and 

designating, as a trace within the segment of code, all instructions in the segment of code 
including, and in-between, the beginning trace instruction and the ending trace instruction. 

26. (original) A method as in claim 14, wherein identifying the dependency order further 
comprises: 

upon completion of execution of at least two traces, updating the dependency order to 
remove any trace dependencies associated with other non-executed traces that depended on 
completion of execution of the at least two executed traces. 

27. (currently amended) A computer program product including a computer-readable 
medium having instructions stored thereon for processing data information, such that the 
instructions, when carried out by a processing device, enable the processing device to perform 
the steps of: 

identifying traces in a segment of code including successive instructions, each of multiple 
identified traces in the segment of code including a set of instructions capable of being executed 
on an execution unit; 

prior to parallel execution of multiple identified traces on corresponding execution units, 
analyzing the traces identified in the segment of code to determine a dependency order for 
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executing the traces, the dependency order identifying at least one of the traces associated with 
the segment of code that cannot be properly executed in parallel with another trace in the 
segment of code; 

receiving a set of traces and, based on the dependency order, to execute traces within the 
set of traces in parallel, the execution taking place in an execution order that is based on the 
identified dependency order, at least two traces being executed in parallel and if the dependency 
order indicates that a second trace is dependent upon a first trace, the first trace being executed 
prior to the second trace; 

executing multiple traces in parallel based on the dependency order; 

temporarily storing results associated with execution of the multiple traces in a temporary 
buffer; and 

at run time, identifying an out-of-order memory dependency condition associated 
with parallel executed traces resulting in an error; and 

in response to identifying the out-of-order memory dependency condition: 

squashing execution of latter traces in the segment of code that depend on results 
from earlier traces and clearing results in the temporary buffer associated with the squashed 
traces. 

28. (currently amended) A trace processor configured to support parallel execution of 
multiple instructions, the trace processor comprising: 

means for identifying traces in a segment of code including successive 
instructions, each of multiple identified traces in the segment of code including a set of 
instructions capable of being executed on at least one execution unit; 

means for analyzing the multiple identified traces identified in the segment of 
code, prior to parallel execution of the multiple identified traces on corresponding execution 
units, to determine a dependency order for executing the multiple identified traces, the 
dependency order identifying at least one of the traces associated with the segment of code that 
cannot be properly executed in parallel with another trace in the segment of code; 
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means for receiving a set of traces and, based on the dependency order, to execute traces 
within the set of traces in parallel, the execution taking place in an execution order that is based 
on the identified dependency order, at least two traces being executed in parallel and if the 
dependency order indicates that a second trace is dependent upon a first trace, the first trace 
being executed prior to the second trace; 

a temporarily buffer coupled to the execution units to store results associated with 
execution of multiple traces of the segment of codes; 

a comparator circuit to detect whether an out-of-order memory dependency condition 
associated with parallel executed traces occurs at run-time of executing the multiple 
traces in parallel, the comparator circuit conditionally loading the results stored in the 
temporary buffer to memory after particular trace completes execution; and 
squashing execution of latter traces in the segment of code that depend on results from 
earlier traces; and 

clears results in the temporary buffer associated with the squashed traces. 

29. (previously presented) A trace processor configured to support parallel execution of 
multiple instructions, the trace processor comprising: 

a fetcher to fetch instructions; 

a trace detector coupled to receive the fetched instructions, the trace detector 
identifying traces in a segment of code including successive instructions, each of multiple 
identified traces in the segment of code including a set of instructions capable of being executed 
on an execution unit; 

a dependency detector that, prior to parallel execution of multiple identified traces 
on corresponding execution units, analyzes the traces identified in the segment of code to 
determine a dependency order for executing the traces, the dependency order identifying at least 
one of the traces associated with the segment of code that cannot be properly executed in parallel 
with another trace in the segment of code; 
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a trace scheduler coupled to the dependency detector and the trace detector, the 
trace scheduler receiving a set of traces and, based on the dependency order, causing the 
corresponding execution units to execute traces within the set of traces in parallel, the execution 
taking place in an execution order that is based on the identified dependency order, at least two 
traces being executed in parallel and if the dependency order indicates that a second trace is 
dependent upon a first trace, the first trace being executed prior to the second trace; and 

multiple execution units to execute the traces in parallel 

a temporary buffer coupled to the execution units to store results associated with 
execution of multiple traces of the segment of code; 

a comparator circuit to detect whether an out-of-order memory dependency condition 
associated with parallel executed traces occurs at run-time of executing the multiple traces in 
parallel, the comparator circuit conditionally loading the results stored in the temporary buffer to 
memory after the particular trace completes execution; and 

a squash circuit coupled to receive a signal from the comparator circuit identifying 
detection of an out-of-order memory dependency condition, the squash circuit: 

squashing execution of latter traces in the segment of code that depend on results 
from earlier traces; and 

clearing results in the temporary buffer associated with the squashed traces. 

30. (canceled) 

3 1 . (previously presented) The trace processor as in claim 29 wherein the squash circuit 
generates a signal to the trace scheduler to reschedule squashed traces for later execution. 

32. (previously presented) The trace processor as in claim 29 further comprising: 

a basic block trace table cache to store trace information associated with a 
currently executed method. 
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33. (previously presented) The trace processor as in claim 32 further comprising: 

bytecode trace fetch logic that utilizes multiple program counters stored in the 
basic block trace table cache to order the fetcher to fetch multiple instructions from multiple 
locations of a method cache. 

34. (previously presented) The trace processor as in claim 32 further comprising: 

a decoded bytecode trace buffer including individual buffers, each individual 
buffer storing instructions for a given trace. 

35. (previously presented) The trace processor as in claim 34, wherein the trace scheduler: 

identifies non-dependent traces based on the trace information in the basic block 
trace table cache; 

selects the set of traces to be executed on corresponding execution units; 
allocates execution units to execute the set of traces in parallel; and 

fetches the set of traces from the decoded bytecode trace buffer for parallel 
execution by the execution units. 

36. (previously presented) The trace processor as in claim 29, wherein each execution unit 
includes an operand stack, a reservation station and an associated functional unit. 

37. (previously presented) The trace processor as in claim 29, wherein each execution unit 
includes multiple sets of shared local variable registers, and a set of local variable registers being 
utilized by a corresponding method. 

38. (previously presented) The trace processor as in claim 29, wherein each execution unit 
includes: 

a load buffer and a store buffer to temporarily store retrieved and modified data 
associated with multiple parallel executed traces in a scratchpad area. 

- END - 
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Allowable Subject Matter 

7. The following is an examiner's statement of reasons for allowance: 

Applicant filed Request for Continued Examination on 01/14/2009 to amend the 
Specification to remove the claim of priority to U.S. Application Serial No. 10/406,709 filed on 
April 3, 2003. Applicant submitted claims 1-6, 8-19, 21-29 and 31-38. Thus, as applicant pointed 
out under Remark section, pages 17-21, Vajapeyam et al. (US 2003/0126408) taken either singly 
and/or in combination with other cited prior arts, do not teach the combined functional 
limitations of a dependency detector that, prior to parallel execution of multiple identified traces 
on corresponding execution units, analyzes the traces identified in the segment of code to 
determine a dependency order for executing the traces, the dependency order identifying at least 
one of the traces associated with the segment of code that cannot be properly executed in parallel 
with another trace in the segment of code; a trace scheduler coupled to the dependency detector 
and the trace detector, the trace scheduler receiving a set of traces and, based on the dependency 
order, causing the corresponding execution units to execute traces within the set of traces in 
parallel, the execution taking place in an execution order that is based on the identified 
dependency order, at least two traces being executed in parallel and if the dependency order 
indicates that a second trace is dependent upon a first trace, the first trace being executed prior to 
the second trace; and multiple execution units to execute the traces in parallel a temporary buffer 
coupled to the execution units to store results associated with execution of multiple traces of the 
segment of code; a comparator circuit to detect whether an out-of-order memory dependency 
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condition associated with parallel executed traces occurs at run-time of executing the multiple 
traces in parallel, the comparator circuit conditionally loading the results stored in the temporary 
buffer to memory after the particular trace completes execution; and a squash circuit coupled to 
receive a signal from the comparator circuit identifying detection of an out-of-order memory 
dependency condition, the squash circuit: squashing execution of latter traces in the segment of 
code that depend on results 

from earlier traces; and clearing results in the temporary buffer associated with the squashed 
traces , as recited in such manners in each of independent claims 1,14, 27, 28 and 29. 

Prior arts of record do not teach and/or suggest these claimed limitations, thus, claims 1- 
6, 8-19, 21-29 and 31-38 are allowed. 

Any comments considered necessary by applicant must be submitted no later than the 
payment of the issue fee and, to avoid processing delays, should preferably accompany the issue 
fee. Such submissions should be clearly labeled "Comments on Statement of Reasons for 
Allowance." 

Conclusion 

8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to ISAAC T. TECKLU whose telephone number is (571)272-7957. 
The examiner can normally be reached on M-TH 9:300A - 8:00P. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Tuan Q. Dam can be reached on (571) 272-3695. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Isaac T Tecklu/ /Tuan Q. Dam/ 
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